Cake: a bioinformatics pipeline for the integrated analysis of somatic variants in cancer genomes
نویسندگان
چکیده
We have developed Cake, a bioinformatics software pipeline that integrates four publicly available somatic variant-calling algorithms to identify single nucleotide variants with higher sensitivity and accuracy than any one algorithm alone. Cake can be run on a high-performance computer cluster or used as a stand-alone application. Availabilty: Cake is open-source and is available from http://cakesomatic.sourceforge.net/
منابع مشابه
Mutascope: sensitive detection of somatic mutations from deep amplicon sequencing
SUMMARY We present Mutascope, a sequencing analysis pipeline specifically developed for the identification of somatic variants present at low-allelic fraction from high-throughput sequencing of amplicons from matched tumor-normal specimen. Using datasets reproducing tumor genetic heterogeneity, we demonstrate that Mutascope has a higher sensitivity and generates fewer false-positive calls than ...
متن کاملA geometric approach for classification and comparison of structural variants
MOTIVATION Structural variants, including duplications, insertions, deletions and inversions of large blocks of DNA sequence, are an important contributor to human genome variation. Measuring structural variants in a genome sequence is typically more challenging than measuring single nucleotide changes. Current approaches for structural variant identification, including paired-end DNA sequencin...
متن کاملPedigree based DNA sequencing pipeline for germline genomes of cancer families
BACKGROUND In the course of our whole-genome sequencing efforts, we have developed a pipeline for analyzing germline genomes from Mendelian types of cancer pedigrees (familial cancer variant prioritization pipeline, FCVPP). RESULTS The variant calling step distinguishes two types of genomic variants: single nucleotide variants (SNVs) and indels, which undergo technical quality control. Mendel...
متن کاملVarSim: a high-fidelity simulation and validation framework for high-throughput genome sequencing with cancer applications
SUMMARY VarSim is a framework for assessing alignment and variant calling accuracy in high-throughput genome sequencing through simulation or real data. In contrast to simulating a random mutation spectrum, it synthesizes diploid genomes with germline and somatic mutations based on a realistic model. This model leverages information such as previously reported mutations to make the synthetic ge...
متن کاملSequence analysis VarSim: a high-fidelity simulation and validation framework for high-throughput genome sequencing with cancer applications
Summary: VarSim is a framework for assessing alignment and variant calling accuracy in highthroughput genome sequencing through simulation or real data. In contrast to simulating a random mutation spectrum, it synthesizes diploid genomes with germline and somatic mutations based on a realistic model. This model leverages information such as previously reported mutations to make the synthetic ge...
متن کامل